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SUMMARY 


Analysis  of  covariance  in  randomized  and  balanced  incomplete 
block  designs  is  reconsidered  in  terms  of  structural  regression.  In 
practice,  the  covariable  is  usually  uncontrolled  and  may  follow  a  linear 
model  as  does  the  variable  of  interest.  If  the  covariable  and  treatments 
are  independent,  then  the  covariable  model  contains  no  treatment  effect, 
and  the  treatment  contrast  precision  on  the  variable  of  interest  is  in¬ 
creased,  not  only  asymptotically,  but  in  the  finite  sample.  If  the  co¬ 
variable  is  affected  by  one  more  of  the  treatments,  then  the  estimation 
of  direct  and  indirect  effects  is  considered.  Finally,  when  structural 
parameters  are  under  identified,  in  which  case  direct  and  indirect  effects 
are  not  estimable,  an  alternative  estimation  procedure  is  discussed. 


1.  INTRODUCTION 


Consider  the  randomized  block  design  model 


y...  =  iji  +  T,  +  P.  +  ci£y  +  e.. 
lij  1  J  2ij  ij 


(1.  1) 


where  t.  (P.)  is  the  fixed  differential  effect  of  the  i  (j  )  treatment 
^  J 

(block)  on  a  response  of  interest,  say  yjy>  which  is  taken  from  the 
(i,  experimentf  l  unit,  i  =  1,  .  .  . ,  q;  j  =  1,  .  .  . ,  r.  n  is 
usually  the  base  from  which  the  differential  effects  are  estimated,  and 
€..  is  the  random  model  error.  The  covariable,  y_,.,  is  assumed  in- 
dependent  of  treatments  and  is  included  in  model  (1.  1)  to  account  for 


differences  in  the  y^^^  from  one  experimental  unit  to  the  next  in  esti¬ 
mating  treatment  contrasts. 


Consider,  next,  the  following  three  conditions  and/or  assumptions 
which  may  accompany  a  covariance  analysis: 


1.  The  covariable  is  uncontrolled,  and  is  measured  with 
ne^igible  measurement  error. 

2.  The  covariable,  though  independent  of  treatments,  is, 
perhaps,  dependent  on  blocks. 

3.  The  covariable  is  unaffected  by  the  variable  of  interest, 
thouf^  not  conversely. 


Under  the  circumstances,  it  may  be  reasonable  to  assume  the  following 
linear  model  for  the  covariable: 


y  .  =  u  +  ?  +  € . 

^2  2j  213 


(1.2) 


th 

where  P_..  is  the  fixed  differential  effect  of  the  i  block  on  y^..,  and 

2i]  •’  •'2ij' 

€  is  the  random  model  error.  For  consistency  of  notation,  model (1.1) 
is  rewritten  as 


+  T.  +  P  +  ay  +  €,... 
1  Ij  213  I13 


When  there  is  doubt  regarding  the  independence  of  treatment  and 
covariable,  then  a  treatment  effect,  say  should  be  included  in  the 
model  for  the  covariable;  i.  e. , 


T_.  +  P-. 

2i  2j 


+  € 


2ij 


so  that  model  (1.  1)  is  now  rewritten  as 


(1.3) 


y,..  »  n,  +  T  +  p  +  ay  +  e 

''lij  1  li  li  ''2ij  lii 


2ij  lij* 


(1.4) 


Substituting  the  expression  for  y...  in  (1.  3)  into  (1.4)  yields 

£1} 


yiii  -  ^ 


where  is  the  overall  treatment  effect  on  y^,  is  the  direct 

treatment  e^ect  on  y^,  and  is  the  indirect  treatment  effect  on  y^ 
or  that  treatment  effect  which  is  passed  on  to  y^  throuf^  y^. 
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^1^2 


In  Sections  2  and  3,  we  will  illustrate  the  following.  Assume 

0  =0.  Then 

*1^2 

<1)  if  treatments  and  covariable  are  independent  and  if 

model  (1.2)  is  adequate  for  the  covariable,  the  treatment 
contrast  precision  on  the  variable  of  interest  (y^)  is  in¬ 
creased,  not  only  asymptotically,  but  in  the  finite  sample; 

(2)  if  treatments  affect  the  covariable  and  if  model  (1.3)  is 
adequate  for  the  covariable,  then  that  estimated  treatment 
effect  on  the  variable  of  interest,  which  is  obtained  through 
the  usual  covariance  analysis,  is  the  direct  treatment  effect 
on  y^;  and  corresponding  treatment  contrasts  also  have 
increased  precision  in  the  finite  sample. 
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If  0  /  0,  which  is  considered  in  Section  4,  it  will  be  shown 

that  statement  (1)  still  holds.  However,  ol  ,  in  the  context  of  state¬ 
ment  (1),  and  direct  and  indirect  effects,  in  the  context  of  statement  (2), 
are  not  estimable  through  usual  techniques  due  to  underidentification. 
When  parameters  are  under  identified,  an  alternative  estimation  pro¬ 


cedure  is  discussed. 


2.  INCREASING  TREATMENT  CONTRAST  PRECISION 
WHEN  THE  COVARIABLE  IS  UNCONTROLLED. 
INDEPENDENT  OF  TREATMENTS.  AND 


2.  1  THE  STRUCTURAL  AND  REDUCED  SYSTEMS. 
AND  THE  ESTIMATION  OF  TREATMENT 
EFFECTS  WHEN  a  IS  KNOWN 


Consider  those  applications  where  the  covariable  is  uncontrolled, 
independent  of  treatments,  and  where  the  system  {described  in  Section  1) 


(2. 1.  1) 


(2.  1.2) 


(2.  1.3) 


i.e..  [  Cjij]  is  identically  and  independently  distributed  with 


expectation  (0.  0].  and  variance 


•  1 


0 


2 


Models  (2.  1,  1)  aiid  (2.  1.  2)  describe  a  partial  and  relevant  structui 
of  the  experimental  unit  and  are  thus  termed  the  structural  regression 
models  or  the  structural  system.  It  is  inconsistent,  logically  though  not 
necessarily  statistically,  to  estimate  the  system's  parameters  separate! 
or  model  by  model;  for  in  doing  so,  is  fixed  in  (2.  1.  1)  and  uncon¬ 
trolled  in  (2.  1.  2);  certainly  y^  cannot  assume  both  roles  simultaneously 
Moreover,  if  instead  of  (2.  1.  2),  y„..  =  Uo  ‘’’o  '•’“'y  i 

separate  least  squares  estimation  leads  to  inconsistent  estimates  as  is 
discussed  Haavelmo  (1943). 

Substituting  (2. 1.  2)  into  (2. 1  1)  yields 

which,  along  with  (2.1.  2),  is  termed  tl^  reduced  system  or  the  reduced 
regression  models.  In  (2.1.4),  not  only  is  directly  estimable,  as 
opposed  to,  say,  the  direct  block  effect,  on  y^,  but  also,  in  the 
reduced  system,  y2  no  longer  assumes  a  dual  role.  Separate  estima¬ 
tion  in  this  reduced  system  yields  consistent  estimates;  however  the 
errors  of  the  reduced  system,  « ®*2ij  *****  *2ij*  correlated, 
which,  coiu:urrent  with  a  nonexistent  treatment  effect  in  (2.1. 2),  Implie 
an  iiK^reased  treatment  contrast  precision  through  joint  estimfition 
(Mallios  (1961)].  as  will  now  be  shown. 
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From  (2.  1.  3)  it  follows  that 


L^iij  ^^2iy  "2ij 


:  i.  i.  d. 


(0,0),  X  ] 


(2.  1.5) 


where 


1  = 


2  ^  ^2  2  2 

c  +  Qt  0  ,  aa 

h  ^2  ^2 


2  2 
CK  0  ^  0 


^°hh'^ 


(2.  1.6) 


h,  h'  -  1,  2.  Denote 


z 


-1 


hh' 

by  (o  )  and  write  the  sample  form  of  (2. 1. 4) 


and  (2.  1,  2)  as  y, .  =  m  +  t.  +  b.  +  e,,,  and  y...  =  m.  +  b„.  +  e...,  re- 

*^lij  1  J  lij  *^213  2  23  213 

spectively,  where  m,  t^,  b^  and  correspond  to  +0^2, 

p.. +Q{^  and  €,..+a€„.,. 

13  23  I13  2ij 

For  a  known,  best  (Markoff)  estimates,  among  the  class  of  linear, 
unbiased  estimators,  are  found  by  taking  partials  of 


2,  2.  q,  r 

z 

h,  h',  i,  j 


e.  ..  e,  ...  0 
hi3  h'13 


hh' 


(2.  1.7) 


equating  these  to  zero,  and  utilizing  the  usual  restrictions  that 


?'■  ■  ? 


0. 


(2.  1.8) 


The  resultant  estimates  are 


t.  =  (Jii-yj)  -  «(y2i-y2>-  “^23  =  ^23  -^2  * 


*>j  =  ^13  ■  yj'  "*2 '  ^2.  "  yr 


where 


“  '  ‘’i2/‘’22’  yw  '  I  yhij^*'-  yfaj "  I  yiuj''’- 


^  Z  ywj'’''- 


1/2 

Also,  with  Oj2  ■  P<0ii022^  in  (2.  1.6), 


(t.  -  t.,)  =  2ojj(l-p^)/r  =  2o^^Jt, 


(2.  1.  10) 


If  T-.  is  included  in  model  (2.  1.2)  when,  in  fact,  t  =  0,  then  t. 

2i  2i  1 


is  estimated  by 


t|°’  =  (yii-yj)  '  ■'i  +  '„  =  -*i  +  “<^21-V 
and  var  (t|°^  -  t|°^  =  2  1 
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2.  2  THE  ESTIMATION  OF  TREATMENT 
EFFECTS  WHEN  0£  IS  UNKNOWN 


For  2^  and  functions  thereof  unknown,  various  estimates  of 
have  been  considered  [Mallios  (1961),  Zellner  (i962)].  though  in  a 
slightly  different  context.  Perhaps  the  most  obvious  estimate  is  that 
which  maximizes  the  likelihood  function,  assuming  normality  in  (2.1.3). 
This  estimate  requires  iteration  to  convergence.  Since  in  (2.  1.9)  0£ 
is  unknown,  olplythe-t.  change  in  iteration  which  implies  that  tlie  un¬ 
biased  estimator 

°22  °  E  ^2ij'  ”2  ■ 

ij 

remains  unchanged  in  iteration,  where  (qr  -  r)  is  the  maximum 

likelihood  estimate  of  o  _.  Choosing  an  initial  estimate  of  t.  as 
tj*^!  in  (2.  1. 11)  and  the  other  estimates  as  in  (2.  1.  9),  then  the  estimators 

“u  “  Z  ■  “2  ■  -  1  -  *•  +  *> 

(2.2.2) 

and 

^  (yj.j  •’  m  -  tj'^^  -  b^)^/(qr  -  q  -  r -f  1)^  (2.2.3) 

ij 

are  consistent  and  unbiased  for  Ojg  Ojj.  From  Oj^Vo*2  "  •  s 

second  estimate  of  say  t|^^  *  (y^^  "  Fj)  "  obtained 


from  which  ^^^2  ‘  *^11^  are  calculated.  This 

process  is  continued  until  stable  estimates,  say  t.*, 

1  1 A  M  iw 

and  0  *j,  are  attained.  The  efficiency  of  the  maximum  likelihood 

2  2 

estimator,  -  t?,,  i  ^  i',  is  given  by  (2.  1.  10)  and  0  =  0  -  a  o__ 

11  C  I  1 

2  2,  ^ 

estimated  by  a*  =  c*  -  of„/o*_. 

11  12  22 

Aside  from  t*  -  t*  ,  all  estimates  obtained  during  the  iterative 
cycle  (except  for  t|*^^  -  t|°^  are  consistent  and  equally  efficient;  i.  e. , 

since  °^ii‘  ^*^^*^22  ~  °^22  the  initial  estimate, 

t!^^  -t|,^\  is  therefore  consistent  and  efficient  [Zellner  (1962)]  ;  and, 

since  the  initial  estimate  and  the  "final"  estimate,  t*  -  t*,  share  the 

same  large  sample  properties,  so  must  all  estimates  obtained  during 

the  iterative  cycle.  While  the  exact  finite  sample  variation  of  these 

estimates  remains  unresolved,  Zellner  (1964)  in  a  somewhat  different 

context,  produced  yet  another  estimator  and  derived  its  finite 

sample  properties.  His  approach  is  as  follows:  Assuming  normality 

in  (2.  1.  3)  and  including  a  treatment  effect,  say  r  =0,  in  (2.  1.  2),  then 

it  is  seen  that  is  estimated  by  t..  =  y-.  -  v-  where  E(t-.)  =  0;  t.  is 
2i  *'  2i  *'2i  2  2i  1 

then  estimated  by  t|^^  in  (2. 1.11);  and  the  consistent,  unbiased  variance' 

,  (o) 

covariance  estimates  s^j  "  <^11 


13 
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and 


s 


22 


Z  *^2ij  ■  ”'2  ■  *2i  " -  q  -  »■  +  1)  (2.2.5) 


IJ 


are  distributed  as  Wishart  variates  independently  of  y^^  -  y^  and 
^2i  "  ^2*  ‘’’i  reestimated  by 


Tj  -  (yii-yi>  -  (®12/®22>  *^21  ■  y2>' 


(2.2.6) 


A  A 

then,  since  s,-  and  S--  are  consistent,  the  efficiency  of  t.  -  t  is 

1.  a  aa  i  i' 


A 


A 


given  by  (2. 1.  10);  moreover,  is  unbiased;  i.  e 


E(?j)  =  E(y,.  -yj)  -  ECy^j  -y^)  =  t. 


since  ®J2^®22  distributed  independently  of  yg^  -  y^,  whose  expec¬ 
tation  is  zero.  Zellner  then  derived  the  exact  second  moment  of 

A  A 

Tf  -  T., ,  given  by 


var(T^-T.,)  =  (2/r)0jj(l-  p^)(l  +  0)  =  (2/r)oJ  (l  +  ^i)  (2.2.7) 

where  ♦  =  (N  -  2)’^  r  [  (N  +  l)/2]/r(N/2)  and  N  =  qr  -  q  -  r  + 1, 
and  showed  that  the  exact  distribution  of  rapidly  approaches 

normality,  for  N  >  10.  Note  that  when  a  =  0,  the  exact  second  moment 
of  a  treatment  contrast  is  2a^  /r;  comparing  the  latter  with  (2.  2.  7),  it 
is  seen  that  the  inclusion  of  covariable  in  (2. 1. 1),  when  in  fact  y^  is 


~  1 1  - 


independent  of  i,  e. ,  ot  -  0,  produces  a  decrease  in  contrast 
precision.  As  such,  it  is  assumed  throughout  that  |o(j  is  substantial. 


A  A  (1)  (1) 

In  comparing  the  estimators  and  t?  "  '  there  is 

reason  to  prefer  the  latter  even  though  its  finite  sample  variation  is, 

at  present,  unknown.  Note  first  that  var  <  var  s. .  ,  where 

hh'  “  nh'  1 1 

is  given  in  (2.  2.  3);  o  is  given  in  (2.  2.  2);  0^2^=  a*^  is  given  in 


(2.  2.  1);  Sjj  =  ®12  given  in  <2.2.  4);  and  s^g  is  given  i 

(2.  2.  5);  i.  e. , 


m 


var  0 


(o)  _ 


11 


=  2o 


jj/(qr  “  q  -  r  +  1)  =  var  s 


11' 


var  0^2^  =  2a^^^/{qr  -  q  -  r  +  1)  =  var  Sjg. 
but 

var  022^  =  2o22/<qr  -  r)  <  2a^^l{qr  -  q  -  r  +  1)  «  var  8^^. 

Since  -  t^,  =  function  <S|2^®22^  “  function 

it  is  likely  that  ^  has  greater  precision  in  finite  samples.  Thus, 

in  actual  application,  it  is  advisable  to  present  both  estimates. 

2.3  TESTS  OF  SIGNIFICANCE,  AND  THE  ANALOGY 
WITH  THE  USUAL  COVARIANCE  METHOD 

In  using  in  estimating  t^,  the  obvious  estimate  of 

’ll  ‘  "11  ■  '’l2''’22  •**  *  *11  ■  •l2'*22-  Ho**''*'- 
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found  from  a  nonlinear  combination  of  the  s, ,  ,,  is  biased;  makinpf  the 

hn' 

2  2  2 

adjustment  (qr  -  q  -  r  +  1)  s*  /(qr  -  q  -  r)  =  s  ,  s  is  unbiased 
2 

for  0^  .  Then  the  null  hypothesis  =  .  .  -  =  0  is  rejected  if 


(qr^r-lji^  -(qr-q-r)s^ 

e  j 


fJy/(q-l)sf^>F„ 


(q  -  1,  qr  -q-r) 


(2.3.1) 


where 


[ 


ij 


(y , -  m  -  b,)(y,.,  -  m ^  -  b,,) 


lij 


'j''^2ij 


2j^ 
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is  the  estimate  of  (qr  -  r  -  1)  o  with  t.  *  0,  and  F  (q  -  1,  qr  -  q  -  r) 

C|  1  cx 

is  the  upper  0(  critical  value  of  the  F  distribution  with  q  -  1  and 


qr  -  q  -  r  degrees  of  freedom.  Note  that  if  ♦  in  (2.  2.  7)  is  ne^^igible 
and  Tj  -  T^,  is  sufficiently  normal,  then  tlie  test  in  (2.  3.  1)  is  nearly 
exact  rather  than  asymptotic. 


In  the  usual  covariance  analysis  [Anderson  and  Bancroft  (1952)] , 
the  estimate  of  is  identically  in  (2. 2. 6),  while  the  adjusted  treat¬ 
ment  mean  square  is  tested  by  (2. 3. 1).  However,  assuming  the  co¬ 
variable  fixed,  the  treatment  contrast  precision  is 
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(2.3.2) 


r  +  (y 


21 


in  the  limit  and  under  (2. 1.2),  (2.  3.  2)  reduces  to  (2. 1.  10),  for  then, 
the  variance  of  the  estimated  slope  is  known  and  -  ygj^j  becomes 
E(y2i)  "  ~  uncontrolled  covariables,  (2.3.2)  is  an 

asymptotic  result  and  is  therefore  comparable  to  (2. 1.  10),  not  to  the 
finite  sample  result  in  (2.  2.  7).  Hence,  imder  (2. 1.  2)  there  results 
not  only  an  increase  in  relative  efficiency,  but  also,  if  ♦  is  ne^igible, 
this  increase  applies  to  finite  samples. 


Herein,  one  result  is  that  is  estimated  by 


A 

T. 

1 


^  ‘  ^®12^®22^  ^2i 


whether  model  (2. 1. 1)  is  considered  adone  (the  usual  covariance 
analysis)  or  whether  the  entire  system  in  (2. 1. 1)  and  (2. 1.  2)  is 
utilized.  In  the  former  case,  the  covariable  is  assumed  fixed,  and 
the  estimated  treatment  effects  are  adjusted  for  differences  in  the 

m 

between  experimental  units.  When  the  entire  system  of  (2. 1. 1)  and 
(2. 1. 2)  is  allied,  the  adjustment  Is  somewhat  different.  Under  (2. 1. 2) 
the  covariable  is  independent  of  treatments.  However,  y-.  -  y^,  the 
estimate  of  is  non>zero  (with  probability  equal  to  1)  so  that 
hi  -  h  becomes  an  estimable  within  sample  bias.  Since  the  errors 
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of  the  reduced  system  in  (2.  1. 4)  and  (2.  1.2)  are  correlated  (when 
®  ^  0),  Vo.  -  Vft  should  be  accounted  for  in  the  estimation  of  t.. 

Thus  -  y^  is  adjusted  by  an  amount  produce  the 

proper  estimate  of 

Example  2.  1.  Williams  (1961,  p.  119)  describes  an  experiment 
on  the  effect  of  temperature  on  the  maximum  compressive  strength  of 
timber  specimen.  Material  from  ten  trees  was  taken  and  a  specimen 
from  each  tree  was  tested  at  each  of  five  temperatures.  The  moisture 
content  of  each  specimen  was  uncontrolled  so  that  a  covariance  adjust¬ 
ment  was  made  to  the  data.  Williams'  detailed  analysis  illustrates,  in 
part,  that  the  residual  variability  was  reduced  substantially  throu^  the 
introduction  of  the  covariable  and  that  temperature  effects  were  hi^y 
si^ificant. 

If  the  trees  (or  blocks)  in  this  example  were  fixed,  then  the 
structural  system  in  (2. 1. 1)  and  (2. 1.  2)  applies  since  moisture  con¬ 
tent  is  independent  of  ten^erature  (moisture  content  is  measured 
prior  to  the  i^|>plication  of  tenq>erature)  and  follows  the  model  in  (2.  i.  2). 
However,  the  fact  that  trees  are  random  will  be  disregarded  in  the 
same  manner  that  least  squares  estimates  are  utilised  in  mixed  and 
random  models.  As  such,  the  treatment  contrast  precision  in  this 
example  is.  from  (2. 2. 7). 


-i5- 


where 


(2/r)  ?  (1+0  -  (1/5)  0^  (1  +0) 
"l  "l 


0  :=  34'^  r(18. 5)/  r(18)  =  0.069. 


Since  ♦  is  negligible,  the  treatment  contrast  precision  becomes 
2 

(1/  5)  0  rather  than  the  usual  result  given  by  (2.  3.  2). 

^1 


2.4  ON  INCREASING  TREATMENT  CONTRAST 
PRECISION  IN  BALANCED  INCOMPLETE 
BLOCK  DESIGNS 


Consider  the  incomplete  block  design  model 


y...  =  A.,  (^i  +  '«'.  +  P.  +  €  ) 

1  3  ij 


(2.4.1) 


where  A^^  =  0  or  1;  i  =  1.  .  .  . ,  q;  j  =  1.  .  .  . ,  r.  Let  u  denote  the  num¬ 
ber  of  times  a  treatment  is  replicated;  let  v  denote  the  number  of 
plots  per  block,  and  let  every  treatment  appear  with  every  other  treat¬ 
ment  in  the  same  block  an  equal  number  of  times,  say  w.  Thus,  we 
have  a  balanced  incomplete  block  (BIB)  design  and  is  estimated  by 


‘1 "  I  ■  1  % 


C.  ij  U 
3  i 


(3.4.2) 


Denoting  the  variance  of  the  model  error  in  (2. 4. 1)  by  the  treat¬ 
ment  contrast  precision  is 
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var  (t:  -  t:,)  •  2 


(2.4.3) 


where  the  efficiency  factor  -  (u(v  -  1)  +  w]/uv  and  E^  <  1  [Anderson 

and  Bancroft  (1952)1  .  In  (2. 4.  1)  if  A..  -  1  for  all  (i,  i),  then  we  have  a 

ij 

randomized  block  (RB)  design  and  the  contrast  precision  is 

2  a^^lT  <  2  0,,/rEj. 

However,  if  blocks  become  heterogenous  when  the  number  of  plots  per 
block  equals  the  number  of  treatments,  then  a  comparison  of  the  con¬ 
trast  precisions  between  the  two  designs  is  misleading;  i.  e. ,  assuming 
heterogeneity  within  blocks,  the  estimated  for  the  RB  design  becomes 
larger  than  the  estimated  for  the  RIB  design. 

It  wUl  now  be  shown  that  it  is  possible  to  utilize  the  BIB  design 
(described  by  model  (2.  4.1))  and  at  the  same  time  to  achieve  a  treatment 
contrast  precision  which  is  nearly  identical  (if  not  greater  than)  the 
2  o^^/r  value  for  the  RB  design. 

In  all  the  design  of  the  experiment,  another  response,  say  y2* 
must  be  identified,  where  y^  is  independent  of  treatments  and  is  highly 
correlated  with  y^.  Then,  if  y^  follows  the  model 

^2ij  %  ^^2  ^2J  ^  *^2il^  (2,4.4) 
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while 


y,..  =  A..  (>i  +  T.  g..  4-ay  +  c  ),  <2.4.5) 

•'lij  ij  1  1  ij  *^213  2ij 


we  may  apply  the  results  of  the  previous  sections.  The  reduced  system 


corresponding  to  (2.  4.  5)  and  (2.  4.  4)  is 


y...  =  A.,  ftn  4-  Ota)  4-  T.  4-  (P,.4-0(B  .)  4-(C  4-0£€  )]  (2.4.6) 

•'lij  ij  1  2  1  Ij  2j  lij  2ij 

and  (2. 4. 4).  Letting  =  u  4- ai*  p  =  3  4-0$  ,  and  c,.  =  €  +ae 

®  1  2  j  Ij  2j  I13  2ij 

it  is  seen  that  the  model  in  (2.  4.  6)  is  identical  to  the  model  in  (2. 4. 1). 

Let  var  4-  a*2ij’  ^2ij^  '^°hh'^  considering  the 

entire  reduced  system,  the  Markoff  estimate  of  is 


‘i  '  1  ■  I  Z 


■  [Z^2io'‘'  ■  Z  Z 


var(t^-y  .  (2/r)(Ojj  -0j2/022>^Ej 


(2.4.7) 
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The  parameter  ®^y  ®i2^®22’ 

precisely  as  in  Section  2.2;  i.  e. ,  including  a  treatment  effect,  say 
T2^  =  0,  in  (2.  4.  4),  then  the  residual  squares  and  cross  products  yield 

A 

the  estimates  s_-  and  s,„.  Thus  t.  is  estimated  by  t.  where 

22  12  j.  1 


A 

T. 


'i  =  I  -1^1 


./rv 


-  Z^ij  Z  W'] 


(2.4,8) 


A  A 

and  the  exact  second  moment  of  t.  -  t.,  is 

1  i' 


A  A 


var  (t.  -  T  )  =  (2o  /r)(l  -  p  )(1  +  ♦)/£ 

1  i'  11  f 


(2.4.9) 


where 


<•  =  (qr  -  q  -  r  -  1;  r  [(qr  -  q  -  r  +  2)/2]  ^  r[(qr  -  q  -  r  +  1/2] 


is  the  finite  sample  correction  factor  as  in  (2.  2.  7).  Note  that  if  ^  is 

2 

ne^igible,  then  (2.4.9)  reduces  to  (2.4.7).  Thus,  if  p  is  sufficiently 

2 

large,  to  the  extent  that  (1  -  p  )  and  cancel,  then  we  have  (2/r)  o^j, 
the  contrast  precision  for  a  RB  design. 
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3.  TH£  QUESTION  OF  DIRECT  AND  INDIRECT 
EFFECTS  WHEN  THE  TREATMENTS 
AFFECT  THE  COVARIABLE  AND 

0  €  €  =  0 
1  2 


3.  1  THE  ESTIMATION  OF  DIRECT  AND  INDIRECT 
EFFECTS  WHEN  ALL  TREATMENTS  HAVE  A 
POSSIBLE  EFFECT  ON  THE  COVARIABLE 


Consider  the  structural  system  (described  in  Section  1) 


y 


lij 


+  T,,  +  P,,  +ay 


li 


Ij 


2i3 


(3.1.1) 


^  “2  ^2i  ^2)  *  '2ij. 


(3.1.2) 


The  reduced  models  are  (3. 1. 2)  and 


Let  the  sample  form  of  (3.1.2)  and  (3. 1 . 3)  be  written  as 


y...  =  m  +  t,  +  b.  +  e... 
’^113  1  1  li3 


3  lij 


(3.1.4) 


and 


^213  =  ”^2  ^  '2i  ^  ^23  ^  ®2i3 


(3.1.5) 


-20- 


where  m,  t.,  b.  and  e,..  correspond  to  u,  +au_,  t 

1  j  lij  1  2  U  2i 

and  ^®spectively.  The  assumptions  regard¬ 

ing  the  e ’s  in  (3.  1.  1)  and  (3.  1. 2)  are  identical  to  those  in  (2.  1. 3). 

Adding  the  restriction  that  ^  =  0  to  those  in  (2.  1. 8),  the  Markoff 

i 

estimates  become 


m  =  y 


1  ' 


“2  "  ^2' 


’'2i  '•^2i 

.  w  2  .  _ 


-y,). 

!l 

1 

*>23  =  6^23  -  ^2*  ■ 

)  and  var  (t«.  -  t„.,)  =  2o^ 
2i  2i'  e 

(3.1.6) 


I 


with  var(t.  -  t.,)  =  (2/rMo  +a  o  )  and  var(t«.-  t„.j  =  2o  /  r. 

1  1*  €  j  Cg  2i  2i'  Cg 

Note  that  the  estimates  in  (3. 1.6)  do  not  contain  elements  of  the  un¬ 
known  a  convenience  which  results  when  the  design  matrices  of 
two  or  more  linear,  reduced  models,  with  correlated  errors,  are 
identical  [Mallios  (1961)]. 

Since  treatments  may  have  direct  and  indirect  effects  on  y^,  the 
parameters  of  the  structural  system  need  be  estimated,  is  esti¬ 
mated  directly  by  tg^;  equating  t^  to  '’"jj  +  ®’’2i  ^2i 

for  there  results  q-1  equations  in  q  unknowns.  But  from  the 

estimate  of  ^  ,  given  by  the  s^,  of  Section 2.2,  we  have  three  equations, 

2  2 

s,,,  s,.,  Sn.,  in  three  unknowns  o  ,  a  and  a.  From  the  estimate  ofoi, 
11  12  22  ^1  ^2 
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given  by  a  =  ®j2^®22'  estimate  the  direct  treatment  effect 

A 

on  y,,  by  t,.  =  t.  -  at_..  But  t,.  is  precisely  t  .  in  (2.2.6),  so  that  the 
•'I  li  1  2i  li  1 

usual  covariance  technique  provides  proper  tests  and  estimates  of  the 
direct  treatment  effect  on  y^,  under  the  structural  system  in  (3.  1.  1) 
and  (3.1. 2);  moreover,  var  (t^^  -  t^.,)  is  given  in  (2.2.7). 

Thus,  if  the  are  significant  while  the  t^^  =  t  ^  are  nonsignificant, 
then  the  treatment  effect  on  y^  is  indirect;  i.  e. ,  the  treatment  affects  y^ 
through  the  covariable  and  not  directly.  If  both  the  t^  and  tg.  are  signif¬ 
icant,  then  there  are  both  direct  and  indirect  treatment  effects  on  y^ . 

Example  3.1.  Anderson  and  Bancroft  (1952,  p.  302)  discuss  an 
experiment  on  the  effect  of  fertilizer  levels  on  the  yield  of  sugar  beets. 
The  covariable  is  stand,  which  may  be  influenced  by  fertilizer  though 
not  by  yield;  and  the  field  is  divided  into  six  homogeneous  blocks .  The 
analysis  shews  that  treatment  effects  on  yield,  adjusted  for  stand,  are 
not  significant,  while  treatment  effects  on  stand  are  highly  significant. 
Disregarding  the  model  in  (3. 1. 2).  the  experiment  certainly  falls  into 
the  "uncertain  class".  However,  stand  is  adequately  predicted  by  blocks 
and  treatments,  as  is  hypothesized  by  model  (3.1.2).  Thus  the  analysis 
implies  that  there  is  an  indirect  rather  than  a  direct  treatment  ei^ect  on 
yield;  i.  e. ,  the  treatment  effect  on  yield  is  through  the  stand.  Though 
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there  is  little  need  to  consider  treatment  contrast  precision  for  direct 
effects  on  yield,  the  value  of  ♦  is  given  by 

(28)'^  r  r  (15.  5)/  r  (15)  =  0,  077. 

Treatment  contrast  precision  for  direct  treatment  effect  on  stand  is 
■”<V  6  whose  estimate  is  318.5. 

3.2  AN  ILLUSTRATION  WHEREIN  THE  TREATMENTS 
DEFINE  A  FACTORIAL  EXPERIMENT  AND  THE 
COVARIABLE  IS  INDEPENDENT  OF  ONE  FACTOR 

Scheffe  (1959,  p.  217)  presents  data  from  an  experimental  piggery 
where  six  young  pigs,  three  male  and  three  female,  were  allotted. to 
each  of  five  pens.  Three  amounts  of  protein,  say,  fj,  and  f^, 
in  increasing  proportion,  were  given  to  one  male  and  one  female  in 
each  pen.  The  pigs  were  weighed  individually  each  week  for  16  weeks, 
and  the  growth  rate  (y^)  in  pounds  per  week  was  calculated  for  each  pig. 
The  weight  (y^)  of  each  pig  at  the  beginning  of  the  experiment  is  the 
measured  covariable. 

For  purposes  of  adding  to  homogeneity  within  pens,  the  allotment 
of  pigs  to  pens  is  by  initial  weight.  Within  each  of  the  two  groups  of 
15  male  and  15  female  pigs,  the  pigs  are  ordered  from  highest  to  lowest 
according  to  initial  weight;  i.e. , 


•  •  « 


are  the  15  male  weights 


>  W 


>  w 


M,  15 


and 


W  >  W  >  ...  >  W  _  are  the  15  female  weights. 

jpj—  r  i  ""  r,iO 

The  three  heaviest  males  and  the  three  heaviest  females  are  assigned 
to  the  first  pen,  the  next  three  males  and  the  next  three  females  are 
assigned  to  the  second  pen,  etc. 

* 

Figure  3.  1  presents  a  path  diagram  relating  sex,  protein,  order¬ 
ing,  and  pen  to  growth  rate  and  initial  weight.  The  arrow  from  to  y 
describes  a  possible  effect  of  initial  weight  on  growth  rate.  Sex  may 
have  a  direct  effect  on  y^  and  y^,  and  hence  an  indirect  effect  on  y^. 
Also,  protein  may  effect  y^,  though  certainly  not  yg,  and  pen  may 
have  an  effect  on  y^. 

Figure  3. 1 

A  Path  Diagram  Relating  Sex,  Protein,  Ordering, 
and  Pen  to  Growth  Rate  (y^ )  and  Initial  Weight  (yg) 


Sex 


Protein 


Pen 


y^i—  Ordering 


The  five  ordered  groups  will,  very  likely,  yield  a  significant 
source  of  variation  for  y  .  These  groups  can  be  looked  upon  as  five 

A 

fixed  blocking  effects  on  y.  in  the  same  way  that  pens  are  blocking 

A 

effects  on  y^ .  However,  it  must  be  assumed  that  the  30  selected  pigs 
are  representative  of  the  population  of  pigs  with  respect  to  initial 
weight. 

From  Figure  3.1,  the  following  linear  structural  models,  in 
sample  form,  are  hypothesized: 

^lijk  *  “'i  ®li  *  ®y2ijk  ^lijk 


"^2  ”*■  ®2i  ^2k  ®2ijk 


(3.2.2) 


where  s^  :  sex  i  »  1,  2;  f.  : 

P*"-  “  ■  * . *•  '’2k= 

models  become  (3.  2.  2)  and 


protein  level,  j  ■  1,  2,  3; 
ordering,  k»  1,  2, .  .  .  ,  5. 


b 


Ik  * 

The  reduced 


yiijk  *  ‘“i  *  •“2*  *  ‘*11  *  "21*  * 


+  (bj^  +  +  (e,y^  + 


(22.3) 


“  *  *1  *  V  ‘••'’y  *  ‘’k  ^  'yk- 


-33  • 


under  the  restrictions  that 


I  Ih-  I  '  Z  V  = 


3 


1] 


The  assumptions  regarding  ®€2ijk  ^2ijk  ' 

errors  corresponding  to  e^^j^  and  ®2ijk’ 


*1,2.  (3.2.4) 

the  population 


where 


)  is  given  in  (2. 1.  6). 


AnUogous  to  the  derivation  of  in  (2. 2. 6).  the  fixed  effect.  «-e 
estimated  as  follows.  Include  a  protein  effect,  say  f^j.  and  a  protein 
by  sex  interaction  effect,  say  (s„f.,).. .  in  (3.  2.  2).  i.  e. , 

«  «  ij 


^2ijk  “  “2  ®2i  ^2j  *  ^®2^2^ij  **2k  *  ®2ijk 


where  E(f2^)  «  EfSgfj)^  ■  0.  Then  a  *  •l2^®22’  where  the 
estimators  are  found  from  the  sum  of  residual  squares  and  cross 
products;  e.  g. . 


•12  “  Z 

ijk 


•26- 


es  u  *•  Ahh'  .  hh'  . 

Substituting  s  for  a  m 


I 

ijk 


,  2  11  ^  12  ^  2  22, 

<*ijk  °  ®ijk  ®2ijk  "  ^  ®2ijk  ^ 


Ahh 


and  holding  the  s  fixed  in  differentiation,  then,  under  (3.2.  4).  the 
estimated  fixed  effects  are 


iHj  =  yi  -  ay^  . 


*li  •  -  *<^21  -  y2>  ' 


'j  •  ^ij  -  yi>  -  ‘(yaj  ■  ^2* 


'  ^Uj  ■  ?!>  ■  *^21j  -  y2»  -  *1  ■  'l 


••ik  •  *  '^2k  -  ^2* 


3,.y„-y,.  •2i*y„-yi 


(3.2.5) 


(3.2.6) 


b^-  bj^  .  -  Jj)  . 


The  elflcieiiey  of  s^  *  *||i  ^IS  while  its  exact  second  moment 

is  (3 0^  / where  ^  is  given  in  (2. 2. 7).  Similarly,  the  effi- 

2  / 

ciency  of  t  *  f.,  is  3e  /  1C  and  its  exact  second  moment  is 
.  .  *  ^  1/ 


lOKl  >  #). 
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Note  that  the  estimators  in  (3. 2.5)  are  precisely  the  estimates 
obtained  through  the  usual  covariances  method  which  considers  only  the 
model  in  (3. 2. 1).  By  introducing  (3. 2.  2),  there  results,  in  estimation, 
a  subtraction  of  ^  expression  for  s^^,  which  yields  the 

direct  sex  effect  of  y^ .  The  subtraction  of  comparable  terms  in  the 
expressions  for  f.  and  (sJ)..  adjusts  for  within  sample  bias,  and 
serves  to  increase  contrast  precision  (as  is  discussed  in  Section  2.3). 

The  analysis,  given  in  Table  3. 1,  illustrates  the  consequences  of 
three  approaches  to  this  problem:  (i)  a  model  is  hypothesised  for  the 
covariable,  though  it  includes  f^^  design  matrices 

of  the  reduced  models  are  identical  and  separate  estimation  is  identical 
to  joint  estimation:  (ii)  only  the  model  in  (3. 2. 1)  is  considered;  (iii)  the 
structural  system  of  (3.2. 1)  and  (3.2.  2)  is  hypothesised. 

Under  (i),  the  results  are  decidely  conservative.  Tests  of  effects, 
for  each  model  separately,  are  shown  in  the  unadjusted  mean  square 
columns  of  Table  3.1.  Here,  neither  treatments  nor  pens  ba\w  a  signifi* 
cant  effect  on  growth  rate,  while  ordering  has  a  highly  significant  effect 
on  initial  weight.  Under  approach  (ii),  the  usual  covariance  method, 
the  adjusted  treatment  mean  square  for  growth  rate  is  significant  at  the 
five  percent  level,  and  this  is  due  to  protein  levels.  The  estimated 
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slope  a  =  0.  088,  and  the  estimated  variance  of  a  is  (0.  253)(442.  933) 

Finally,  under  <iii),  statements  regarding  treatment  effects  on  are 

identical  to  those  under  approach  (ii).  In  addition,  sex  has  a  ne^igible 

effect,  not  only  on  growth  rate,  but  on  initial  weight.  And  from  the 

unadjusted  mean  square  column  for  y^,  it  is  seen  that  the  model  for 

initial  weij^t,  given  In  (3.  2.2),  is  adequate  with  s^^^  =  0.  Protein 

contrast  precision  is  estimated  by  2s^  /l5  =  0.034;  however,  N  =  20 
-1  -1/2 

and  #  =  (18)  it  r(10.  S)/r  (10)  >  0. 097  is  ne^igible,  so  that  the 
latter  precision  and  the  tests  of  significance  apply  to  finite  samples. 

In  Section  2. 2  four  treatment  effect  estimates  were  discussed: 
t^^\  the  unat^usted,  unbiased  conservation  estimate;  t^^^,  the  efficient 
estimate  based  on  one  cycle  of  iteration;  the  usual  covariance 
estimate;  and  t’*'.  the  maximum  likelihood  estimate.  Table  3. 2  pre¬ 
sents  the  three  protein  means  based  on  these  four  estimates,  i.  e. , 

+  Jj .  f^'^  ^  Xj  correspond  to  t^®^  t  , 

and  t^.  respectively. 


iM 


Table  3 ,  1 

Analysis  of  Covariance  of  Growth  Rate  (y^)  and  Initial  Weight  {y^) 
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4.  THE  ESTIMATION  OF  STRUCITURAL 
PARAMETERS  WHEN 
0  ^  0 


4,  1  PRELIMINARY  REMARKS 


In  Section  2.  2,  the  reduced  system 


yiij  '  ^  "i  ^  *“^21>  ^  ‘"ill 


Ij  -2]'  lij  2ij' 


y2lj  =  >'2  ^  ^2j>  ^2« 


which  corresponds  to  the  structural  system  in  (2.  1.  1)  and  (2.  1. 
considered  under  the  assumption  that 


var 


+  a£_.. 
lij  2ij 


€«• 

2iJ 


^22  2 

+  a  a  ,  oca 

I  ^2  ^2 

2  2 

a  a  ,  a 

^2  ^2 


If  0  ^0,  then 


^1^2 


var 


” 

2  „  2  2  2 

'ly  *  “'2y 

0  +2oi!o  +ao  *0  +ao 

^1  ^1^2  ^2  ^1®2  ®2 

• 

^2  2 

'2y 

0  +  CKo  f  0 

^1^2  ®2  ^2 

(4.1.1) 

(4.1.2) 

2)  was 

(4.1.3) 

(4.1.4) 
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and  the  treatment  contrast  precision  is  still  increased;  i  -  e. , 


^  =  yii  -  y,  -  <«i2/«22><y2i  ■  ^2’ 


and 


var  (t.  -  T.,)  =  (2  Ojj/r)(l  -  p^)(l  +  0) 


r)(l  +  ♦)  if  0  =  0  , 

^1^2 


where  is  that  estimate  of  ^22  which  finite  san4>le 


properties  are  available.  However, 


‘’12' ”22  '  ‘V2*“‘’'2’ 


/  ^2  h 


^  0 


®12^®22  ""  ®  °ej€2  *  ®' 


Moreover,  within  the  realm  of  the  data  and  assuming  no  further  prior 
knowledge,  a  cannot  be  estimated  by  existing  techniques  due  to  under- 
identification;  i.  e. ,  in  model  (4. 1. 1)  there  are  q  +  2r  unkiK>wn  param¬ 


eters  (^j,  ^2*  ®  '  Tj,  ...,  j,  Pjj, 


^1,  r-l'  ^21' 


^2,r-l^* 


and  in  the  covariance  matrix  of  (4. 1. 4)  there  are  an  additional  three 
2  2 

parameters  (0  ,  a  ,  and  0  ),  so  that  the  total  number  of  un- 

€  €  ^  ^  ^ 

12  12 

known  parameters  is  q  '>■  2r  +  3;  but  from  the  reduced  system  of 
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(4.  1. 1)  and  (4.  1.  2)  and  from  the  estimate  of  the  covariance  matrix  in 

(4.  1. 4),  there  are  only  q  +  2r  +  2  equations;  and  it  is  easily  seen  that 

2 

the  underidentified  parameters  are  a,  o  ,  and  o 

^1^2 

The  estimation  of  these  underidentified  parameters  may  be  of 
importance  for  the  following  reasons. 


(1)  Since 


and  since 


A 

T 

i 


0  /o 
12'  22 


function  (s 

X  Ck 

2 

(  0  +  Oo 

€ 


/s 

) 

2 


/ 


€ 


2 


2 

it  may  be  thato  (^0)  is  approximately  equal  to  cxo  0) 
€j€2  "  ^  €2 

but  opposite  in  sign.  In  this  case,  approximately 

zero,  and  from  the  sample,  one  might  be  led  to  the  mistaken 
conclusion  that  y^  is  independent  of  y^,  or  nearly  so. 

Thus,  if  a  resolvement  of  the  structure  of  the  experimental 
unit  is  the  issue,  then  existing  techniques  should  be  applied 
and  presented  with  caution. 


(2)  If  0  M,  then  there  is  the  very  basic  question,  "Why 

are  the  errors  of  the  structural  system  correlated?  "  It 

may  be  that  the  structural  models  are  inadequate  in  that 

other  important  variables  have  been  ne^ected,  in  which 

case  0,  ,  On  the  other  hand,  it  may  be  that  the 

®1®2 
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structural  models  are,  in  fact,  adequate  and  that  o  0 
is  due  to  the  extraneous  effects  of  the  infinity  of  variables 
■which  can  be  measured  from  an  experimental  unit. 

In  what  is  to  follow,  an  estimation  technique  is  discussed  whereby 
underidentified  parameters  are  estimable.  However,  no  pretense  is 
made  that  the  questions  posed  in  (1)  and  (2)  can,  at  present,  be  ade¬ 
quately  resolved  or  even  nearly  so.  All  that  is  done  is  to  suggest  an 
approach  which  may  be  of  some  value  in  the  actual  consideration  of 
(1)  and  (2). 

4.2  AN  EXTENSION  OF  THE  "INSTRUMENTAL 
VARIABLE  ESTIMATION  TECHNIQUE" 

Consider  the  functional  relationship  [Williams,  (1961,  Chapter  11)] 

n  7  5  (4.2.1) 

where  7  is  a  parameter  to  be  estimated,  and  t)  and  i  are  measured 
with  error  by 

y=n-€,  x=t+b.  (4.2.2) 

y»  7xH'(€-|'76)  is  obtained  by  substituting  the  e]q>ressions  in  (4. 2. 2) 
into  (4. 2.1).  In  a  sample  of.  say,  size  n,  the  least  squares  estimate 
of  7  may  be  inconsistent,  since  x  is  correlated  with  the  error  term 
€  't>7& .  As  such,  an  alternative  method  of  estimation  is  now  discussed. 
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Assume 


€. 

1 

B. 

_  1 

o  9  9  th 

i  =  1. .  .  . ,  n,  so  that  (c .  +  76 .)  :  i.  i.  d.  (0,  0  +7  o- ).  From  the  i 

11  Co 

ei^rimental  unit  from  which  the  responses  and  are  drawn,  let 

u.  denote  another  variable  which  will  be  termed  an  instrumental  vari- 
1 

able.  Assume  that  the  functional  relationship  in  (4.  2. 1)  is  invariant 
under  changes  in  extraneous  (instrumental)  variables  and  that  the 
measurement  errors  are  uncorrelated  with  the  instrumental  variable. 
Then  in  a  regression  of  *7^^  on  u^,  say, 

Ci  +  rSj  =  +  o^Uj  ♦ 

a  «  OL  «  0,  where  o  .  a, ,  a  are  intercept,  slope,  and  model  error, 
01  01 

respectively,  h)  the  8aii^>le,  the  estimate  of  is 

•i  •  Z'*i*  Z  “?• 

i  i 

assuming  u  «  0.  But  «  0,  and  equating  a^  to  zero  yields 

TOjtUj  ■  0  .  £  (jrj-r.j)uj 

i  i 
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whence 


i  i 

A 

is  a  consistent  estimate  of  7  .  /  is  termed  the  instrumental  variable 
estimate  of  7. 

We  will  now  show  wby  the  instrumental  variable  estimation  tech¬ 
nique  cannot,  in  its  present  context,  be  applied  to  the  estimation  of 
parameters  in  a  regression  relationsh4>  where  the  independent  variable 
is  uncontrolled  and  measured  with  error.  Consider  the  regression  model 

1  =  r  e  +  c*  (4.2.4) 

where  the  intercept  is  assumed  zero,  n  and  i  are  measured  with  error 
according  to  (4. 2. 2)  and  is  the  model  error.  Substituting  the  ex¬ 
pressions  in  <4.  2. 2)  into  (4.  2. 4),  we  have 

y  -  +  75  ♦  c"**)  .  (4.2.5) 

Again,  in  a  sample  of  size  n,  the  usual  least  squares  estimate  of  > 
may  be  inconsistent  since  x  and  5  are  correlated.  In  addition,  the 
error  term  e  ^  +  c  can  hardly  be  assumed  independent  of  extra¬ 

neous  variables  since  the  model  error  is  composed  of  variables  such 
as  u;  i.  e. ,  the  regression  relationship  is  not  invariant  under  changes  iii 
extraneous  or  instrumental  variaUes. 
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To  apply  instrumental  variable  estimation  to  a  regression  rela¬ 
tionship,  we  must  find  a  variable  u  such  that  t  +  is  inde¬ 

pendent  of  u.  Such  a  variable  u  is  available  from  two  sources,  a 
table  of  random  numbers  and  disassociated  experiments;  i.  e. 

€  +  76  4“  is  independent  of  u  if  the  u^  are  drawn  from  a  table  of 
random  numbers;  also,  if  the  y.  and  in  (4.  2.  5)  correspond  to,  say, 
a  biological  experiment  and  the  u^  are  responses  taken  from  an 
unrelated  industrial  experiment,  then  £ -t*  76  +  is  again  inde¬ 
pendent  of  u.  Consequently,  the  instrumental  variable  estimation 
technique  can  be  applied  to  a  regression  relationship  (where  the  in¬ 
dependent  variable  is  subject  to  measurement  error  and  is  uncontrolled) 
if  the  u^  are  properly  chosen. 


If  the  independent  variable, 
if  the  measurement  error  b  0, 

of  r,  ov  c  •  Y.  Vi  /Z  V 

i  I 


(,  in  (4.  2.  4)  is  controlled  and/or, 
then  the  usual  least  squares  estimate 
is  the  best  estimate  and 


(4.2.6) 


2  2  2  4- 

where  0  *  ^i  ''i  P**®****®®®  estimate 

of  7  as  given  in  (4.  2. 3),  where 

var  7  *  0*  X  “i  /  ^  Z  “i  V*  (4.2.7) 
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Comparing  the  result  in  (4.  2.  6)  with  that  in  (4.  2.  7),  then  we  have  the 
very  obvious  result  that 

A 

var  7  >  var  c  where  u  x 

=  var  c  where  u  =  x  ,  i.  e. , 

E  “V  *E  -  “V  E 

since 

The  latter  is  a  well  known  inequality. 

Thus,  we  have  shown  that  if  least  squares  estimation  (when  appli 
cable)  is  compared  to  instrumental  variable  estimation,  the  latter  pro* 
duces  conservative  estimates  but  has  broader  applicability. 

4.3  AN  EXAMPLE  OF  UNDERIDENTIFIED  PARAMETERS 
AND  THEIR  ESTIMATES  THRCXJGH  INSTRUMENTAL 
VARIABLE  ESTIMATION 

In  the  previous  section,  the  instrumental  variable  estimation 
technique  was  utilized  in  the  estimation  of  parameters  wl^n  there  exis' 
substantial  measurement  error,  in  this  section,  as  in  Sections  1«  2, 
and  3.  we  assume  that  measurement  errors  are  negIigU>le.  and  the 
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instrumental  variable  estimation  technique  is  applied  for  the  expressed 
purpose  of  estimating  underidentified  parameters. 

Consider  the  structural  system 


y 


li 


(4.3.1) 


+ 


(4.3.2) 


+  e 


2i 


(4.3.3) 


i  =  1,  .  .  .,  n,  where  and  are  population  means;  (4.  3.  3)  is  the 
sample  form  of  (4.  3.  2);  o:  is  the  rate  of  change  in  per  unit  change 
in  y-;  and  e,.  and  are  model  errors.  It  is  assumed  that 

^2  li  2i 


The  reduced  system  corresponding  to  (4.  3.  1)  and  (4.  3.  2)  is 


(4.3.4) 


u:  +  C. 

1 


m  +  e. 
1 


(4.3.5) 


and  (4.  3.  2),  and  the  covariance  matrix  of  the  reduced  model  errors, 

(e, .  +  c  ),  as  given  in  (4.  1.  4). 

li  2i  2i  ® 

The  Markoff  estimates  of  (p.^  +  ^^2’ 

2  2  2  2  2 
0  +  2  ^  +0^0,0  +0£c,  and  o  are,  respectively; 

^1  ^1^2  ^2  ^1^2  ^2  ^2 

Z  yii/"' Z  ^2i"'  Z'^ii'^i’ /'"■*’•  Z<yu-yi>'y2i-yi>/<’’'‘’- 

i  i  i  i 

and  2  <y2i-y2*^ 
i 

identified,  and  the  instrumental  variable  estimation  technique  is  applied. 


/ 


(n-1).  Thus.  Pj,  a 


^1^2 


and  0  are  under - 
"l 


The  n  X  1  vectors  u.  =  (u, .)  and  u„  =  (u_.)  are  drawn,  say,  from 

-1  U  -2  2i 

a  table  of  random  numbers.  Let  U  (n  x  2)  =  (u^ ,  u^)  and  u^  =  u^  =  0. 
Then  in  the  regression  model 


li 


=  a  +  o.  u 


1  li 


a  u 
2  2i 


+  A. 


(4.  3.6) 


a  =  a  =  a  =  0,  where  c, .  is  the  model  error  in  (4.  3.  1)  and  A.  is  the 
o  1  2  li  1 

model  error  in  (4,  3.  6).  Let  (n  x  1)  =  Then  the  least  squares 

estimates  of  ct^  and  O!^,  say 


are  equated  to  zero,  so  that 
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Q  > 


r 


[u’  i3  U’  (jTj  -  xe  j)  =  0 

where  jr ^  (n  x  1)  -  (y^ .).  X  (n  x  2)  =  0  ,  j_  (n  x  1)  =  ( 1 ), 

^2  (n  X  1)  =  (y^.),  and  9^  =  .  a).  Thus, 


A 

8 


-1 


=  (U*  X)  U'  yj 


(4.3.7) 


var  e  =  (U'  X)“^  (U'  U)“^  (U*  X)'"^  , 

1 


2 

and  the  estin^ate  of  is 


A2 

0 

e 


.  ■  I  •«/ 


(n  -  2)  , 


where  e,,  =  y^.  -  -ay^.  . 


A  _  2 

The  estimate  of  ^Jl-  is  -  y„  so  that  a  is  estimated  by 

c  2  2  €  2 

2  “  1  4  /<“-*>■  ®2i'y2i-y2'  ^^2 


IS 


estimated  by  a 


e  G 

1  2 


Z  ®2i  / 


(n  -  2). 


e  .  ,  .A  A  /A  A 

A  significant  departure  of  p  =  o  /  o  o  from  zero  would 

'l*2  '  *1  '2 


imply  the  rejection  of  the  hypothesis  that  o  .  =0.  Utilizing  an 

®l'*2 

approximate  test,  the  hypothesis  o  =  0  is  rejected  if 

12 

t  -  [(n  -  3)^^/(l  >  tj,  [(n-3)d.f.  1  . 
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where  t  is  the  upper  critical  value  of  Student's  t  distribution  with 
n  -  3  degrees  of  freedom. 


4.4  THE  ESTIMATION  OF  DIRECT  AND  INDIRECT 
EFFECTS,  AS  DISCUSSED  IN  SECTION  3,  WHEN 


Consider  the  structural  system  in  (3. 1.  1)  and  (3.  1.  2).  The 

corresponding  reduced  system,  in  sample  form,  is  given  by  (3.  1.  4) 

and  (3.  1.  5).  Since  direct  and  indirect  effects  on  y^  are  underidentified, 

when  0  ^  0,  we  again  apply  the  instrumental  variable  estimation 

1  2 

technique. 


Let  the  sample  form  of  (3. 1.  1)  be  written  as 


.  =  m,  +  t  ■»-  b 
111  1  li 


A 

+  ay_. .  +  e 

Uj 


(4.4.1) 


Then 

A  X 

-  X  (4.4.2) 

A  A  + 

♦•where  (qr  x  1)  ^  *  (q  +  r  +  2)] 

is  the  design  matrix  corresponding  to  the  model  in  (3.  1.  1),  and 

0,  =  (m,  ,  t, t,  ,  b, b,  ,  a)'.  Since  X  :s 
— 1  1  11  iq  11  iq 

singular,  choose  a  basis  of  X^,  say  X  [qr  x  (q  +  r)]  .  Then 

Kq  +  r)  x  1]  is  the  corresponding  vector  of  non-redundant  param¬ 
eters,  and  (4.  4.  2)  is  rewritten  as 
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A 


A 


Zj  -  X  0  . 


(4.4.3) 


Select  q  +  r  vectors,  say  u,,.  .  u.,.  .  u  .  ,  such  that  e. 

—1  —£  -q+r  1 


is  independent  of  ;  and  let 

U  [  qr  X  (q  +  r)  ]  =  (u^,  .  .  . ,  u^,  .  .  , ,  . 

Then  in  the  model 


=  0!  +  0!  U  +  .  .  +  G  u  +  .  .  +  0!  U  +A  , 

Toll  £  £  q+r  q+f . 


the  G's  are  zero.  The  least  squares  estimate  of  a=  (a  )  is 


a  =  (U'  U)"^  U'  Cj 


A 


Equating  a  to  0^,  0  in  (4.  4.  3)  is  given  by 


0  -  (U'  X)'^  U' 


and 

var  0  --  (U'  X)"^  (U*  U)'^  (U'  X)'"^  =  V  . 

A 

Let  z  and  V  denote  that  portion  of  0  and  V  corresponding  to 
the  non-redundant  direct  treatment  effect  estimates  on  yj.  Then  the 
hypothesis  '’’’jj  =  *  •  ’  ’’^iq  '  ®  rejected  if 

-1  A2 

(Jz)'  (JV,  J»)  (Jz)/(q  -  i)o;  >  F 
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where  is  the  upper  a  critical  value  of  the 
and  qr  -  q  -  r  degrees  of  freedom,  and 


distribution  with  q  -  1 


0 


1.  -1 
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